Overview
Brought to you by YData
Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 10000 |
| Missing cells | 9465 |
| Missing cells (%) | 7.9% |
| Duplicate rows | 614 |
| Duplicate rows (%) | 6.1% |
| Total size in memory | 937.6 KiB |
| Average record size in memory | 96.0 B |
Variable types
| Text | 9 |
|---|---|
| Numeric | 1 |
| Categorical | 2 |
| Dataset has 614 (6.1%) duplicate rows | Duplicates |
company_type has 445 (4.5%) missing values | Missing |
employee_count has 254 (2.5%) missing values | Missing |
ownership_status has 7228 (72.3%) missing values | Missing |
company_age has 719 (7.2%) missing values | Missing |
head_quarters has 819 (8.2%) missing values | Missing |
Reproduction
| Analysis started | 2024-12-28 21:05:53.914355 |
|---|---|
| Analysis finished | 2024-12-28 21:05:55.768687 |
| Duration | 1.85 second |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
name
Text
| Distinct | 9368 |
|---|---|
| Distinct (%) | 93.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
Length
| Max length | 64 |
|---|---|
| Median length | 50 |
| Mean length | 17.3459 |
| Min length | 2 |
Unique
| Unique | 8736 ? |
|---|---|
| Unique (%) | 87.4% |
Sample
| 1st row | TCS |
|---|---|
| 2nd row | Accenture |
| 3rd row | Cognizant |
| 4th row | Wipro |
| 5th row | ICICI Bank |
| Value | Count | Frequency (%) |
| india | 526 | 2.2% |
| services | 437 | 1.8% |
| solutions | 407 | 1.7% |
| technologies | 379 | 1.6% |
| 336 | 1.4% | |
| group | 281 | 1.2% |
| industries | 242 | 1.0% |
| engineering | 180 | 0.8% |
| systems | 167 | 0.7% |
| and | 164 | 0.7% |
| Other values (9201) | 20511 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 14100 | 8.1% |
| 13632 | 7.9% | |
| a | 12815 | 7.4% |
| i | 11740 | 6.8% |
| n | 11319 | 6.5% |
| o | 10442 | 6.0% |
| r | 9705 | 5.6% |
| t | 9190 | 5.3% |
| s | 8780 | 5.1% |
| l | 6653 | 3.8% |
| Other values (77) | 65083 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 173459 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 14100 | 8.1% |
| 13632 | 7.9% | |
| a | 12815 | 7.4% |
| i | 11740 | 6.8% |
| n | 11319 | 6.5% |
| o | 10442 | 6.0% |
| r | 9705 | 5.6% |
| t | 9190 | 5.3% |
| s | 8780 | 5.1% |
| l | 6653 | 3.8% |
| Other values (77) | 65083 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 173459 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 14100 | 8.1% |
| 13632 | 7.9% | |
| a | 12815 | 7.4% |
| i | 11740 | 6.8% |
| n | 11319 | 6.5% |
| o | 10442 | 6.0% |
| r | 9705 | 5.6% |
| t | 9190 | 5.3% |
| s | 8780 | 5.1% |
| l | 6653 | 3.8% |
| Other values (77) | 65083 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 173459 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 14100 | 8.1% |
| 13632 | 7.9% | |
| a | 12815 | 7.4% |
| i | 11740 | 6.8% |
| n | 11319 | 6.5% |
| o | 10442 | 6.0% |
| r | 9705 | 5.6% |
| t | 9190 | 5.3% |
| s | 8780 | 5.1% |
| l | 6653 | 3.8% |
| Other values (77) | 65083 |
rating
Real number (ℝ)
| Distinct | 34 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.8625 |
| Minimum | 1.2 |
|---|---|
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 1.2 |
|---|---|
| 5-th percentile | 3.2 |
| Q1 | 3.6 |
| median | 3.9 |
| Q3 | 4.1 |
| 95-th percentile | 4.4 |
| Maximum | 5 |
| Range | 3.8 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.39013968 |
|---|---|
| Coefficient of variation (CV) | 0.10100704 |
| Kurtosis | 1.4444985 |
| Mean | 3.8625 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | -0.62235497 |
| Sum | 38625 |
| Variance | 0.15220897 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=34)
| Value | Count | Frequency (%) |
| 4 | 1129 | |
| 3.9 | 1120 | |
| 4.1 | 1098 | |
| 3.8 | 1047 | |
| 3.7 | 882 | |
| 4.2 | 854 | |
| 3.6 | 617 | 6.2% |
| 3.5 | 580 | 5.8% |
| 4.3 | 557 | 5.6% |
| 3.4 | 404 | 4.0% |
| Other values (24) | 1712 |
| Value | Count | Frequency (%) |
| 1.2 | 1 | < 0.1% |
| 1.6 | 2 | < 0.1% |
| 1.9 | 2 | < 0.1% |
| 2 | 3 | < 0.1% |
| 2.1 | 4 | < 0.1% |
| 2.2 | 5 | 0.1% |
| 2.3 | 5 | 0.1% |
| 2.4 | 12 | |
| 2.5 | 7 | 0.1% |
| 2.6 | 22 |
| Value | Count | Frequency (%) |
| 5 | 7 | 0.1% |
| 4.9 | 24 | 0.2% |
| 4.8 | 45 | 0.4% |
| 4.7 | 73 | 0.7% |
| 4.6 | 111 | 1.1% |
| 4.5 | 194 | 1.9% |
| 4.4 | 320 | 3.2% |
| 4.3 | 557 | |
| 4.2 | 854 | |
| 4.1 | 1098 |
company_type
Text
Missing 
| Distinct | 90 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 445 |
| Missing (%) | 4.5% |
| Memory size | 78.3 KiB |
Length
| Max length | 53 |
|---|---|
| Median length | 28 |
| Mean length | 15.535322 |
| Min length | 3 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | IT Services & Consulting |
|---|---|
| 2nd row | IT Services & Consulting |
| 3rd row | IT Services & Consulting |
| 4th row | IT Services & Consulting |
| 5th row | Banking |
| Value | Count | Frequency (%) |
| 3555 | 17.1% | |
| services | 1680 | 8.1% |
| consulting | 1395 | 6.7% |
| it | 1332 | 6.4% |
| engineering | 496 | 2.4% |
| construction | 496 | 2.4% |
| auto | 455 | 2.2% |
| components | 455 | 2.2% |
| industrial | 445 | 2.1% |
| machinery | 401 | 1.9% |
| Other values (123) | 10122 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 13372 | 9.0% |
| e | 12691 | 8.5% |
| i | 12455 | 8.4% |
| 11277 | 7.6% | |
| t | 9807 | 6.6% |
| r | 8233 | 5.5% |
| a | 7657 | 5.2% |
| o | 7621 | 5.1% |
| s | 7484 | 5.0% |
| c | 6723 | 4.5% |
| Other values (42) | 51120 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 148440 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 13372 | 9.0% |
| e | 12691 | 8.5% |
| i | 12455 | 8.4% |
| 11277 | 7.6% | |
| t | 9807 | 6.6% |
| r | 8233 | 5.5% |
| a | 7657 | 5.2% |
| o | 7621 | 5.1% |
| s | 7484 | 5.0% |
| c | 6723 | 4.5% |
| Other values (42) | 51120 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 148440 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 13372 | 9.0% |
| e | 12691 | 8.5% |
| i | 12455 | 8.4% |
| 11277 | 7.6% | |
| t | 9807 | 6.6% |
| r | 8233 | 5.5% |
| a | 7657 | 5.2% |
| o | 7621 | 5.1% |
| s | 7484 | 5.0% |
| c | 6723 | 4.5% |
| Other values (42) | 51120 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 148440 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 13372 | 9.0% |
| e | 12691 | 8.5% |
| i | 12455 | 8.4% |
| 11277 | 7.6% | |
| t | 9807 | 6.6% |
| r | 8233 | 5.5% |
| a | 7657 | 5.2% |
| o | 7621 | 5.1% |
| s | 7484 | 5.0% |
| c | 6723 | 4.5% |
| Other values (42) | 51120 |
employee_count
Categorical
Missing 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 254 |
| Missing (%) | 2.5% |
| Memory size | 78.3 KiB |
| 1k-5k Employees | |
|---|---|
| 201-500 Employees | |
| 501-1k Employees | |
| 51-200 Employees | |
| 5k-10k Employees | |
| Other values (5) |
Length
| Max length | 20 |
|---|---|
| Median length | 17 |
| Mean length | 15.961625 |
| Min length | 14 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 Lakh+ Employees |
|---|---|
| 2nd row | 1 Lakh+ Employees |
| 3rd row | 1 Lakh+ Employees |
| 4th row | 1 Lakh+ Employees |
| 5th row | 1 Lakh+ Employees |
Common Values
| Value | Count | Frequency (%) |
| 1k-5k Employees | 2665 | |
| 201-500 Employees | 2116 | |
| 501-1k Employees | 1845 | |
| 51-200 Employees | 1640 | |
| 5k-10k Employees | 504 | 5.0% |
| 10k-50k Employees | 487 | 4.9% |
| 11-50 Employees | 315 | 3.1% |
| 1-10 Employees | 88 | 0.9% |
| 1 Lakh+ Employees | 55 | 0.5% |
| 50k-1 Lakh Employees | 31 | 0.3% |
| (Missing) | 254 | 2.5% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| employees | 9746 | |
| 1k-5k | 2665 | 13.6% |
| 201-500 | 2116 | 10.8% |
| 501-1k | 1845 | 9.4% |
| 51-200 | 1640 | 8.4% |
| 5k-10k | 504 | 2.6% |
| 10k-50k | 487 | 2.5% |
| 11-50 | 315 | 1.6% |
| 1-10 | 88 | 0.4% |
| lakh | 86 | 0.4% |
| Other values (2) | 86 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 19492 | |
| 0 | 13385 | 8.6% |
| 1 | 11994 | 7.7% |
| 9832 | 6.3% | |
| y | 9746 | 6.3% |
| s | 9746 | 6.3% |
| E | 9746 | 6.3% |
| o | 9746 | 6.3% |
| l | 9746 | 6.3% |
| m | 9746 | 6.3% |
| Other values (9) | 42383 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 155562 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 19492 | |
| 0 | 13385 | 8.6% |
| 1 | 11994 | 7.7% |
| 9832 | 6.3% | |
| y | 9746 | 6.3% |
| s | 9746 | 6.3% |
| E | 9746 | 6.3% |
| o | 9746 | 6.3% |
| l | 9746 | 6.3% |
| m | 9746 | 6.3% |
| Other values (9) | 42383 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 155562 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 19492 | |
| 0 | 13385 | 8.6% |
| 1 | 11994 | 7.7% |
| 9832 | 6.3% | |
| y | 9746 | 6.3% |
| s | 9746 | 6.3% |
| E | 9746 | 6.3% |
| o | 9746 | 6.3% |
| l | 9746 | 6.3% |
| m | 9746 | 6.3% |
| Other values (9) | 42383 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 155562 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 19492 | |
| 0 | 13385 | 8.6% |
| 1 | 11994 | 7.7% |
| 9832 | 6.3% | |
| y | 9746 | 6.3% |
| s | 9746 | 6.3% |
| E | 9746 | 6.3% |
| o | 9746 | 6.3% |
| l | 9746 | 6.3% |
| m | 9746 | 6.3% |
| Other values (9) | 42383 |
ownership_status
Categorical
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 7228 |
| Missing (%) | 72.3% |
| Memory size | 78.3 KiB |
| Public | |
|---|---|
| Startup | |
| Forbes Global 2000 | |
| Fortune India 500 | 116 |
| Conglomerate | 96 |
| Other values (4) | 160 |
Length
| Max length | 18 |
|---|---|
| Median length | 6 |
| Mean length | 8.2316017 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Public |
|---|---|
| 2nd row | Public |
| 3rd row | Forbes Global 2000 |
| 4th row | Public |
| 5th row | Public |
Common Values
| Value | Count | Frequency (%) |
| Public | 1778 | 17.8% |
| Startup | 340 | 3.4% |
| Forbes Global 2000 | 282 | 2.8% |
| Fortune India 500 | 116 | 1.2% |
| Conglomerate | 96 | 1.0% |
| Indian Unicorn | 78 | 0.8% |
| Central | 43 | 0.4% |
| State | 30 | 0.3% |
| MNC | 9 | 0.1% |
| (Missing) | 7228 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| public | 1778 | |
| startup | 340 | 9.3% |
| forbes | 282 | 7.7% |
| global | 282 | 7.7% |
| 2000 | 282 | 7.7% |
| fortune | 116 | 3.2% |
| india | 116 | 3.2% |
| 500 | 116 | 3.2% |
| conglomerate | 96 | 2.6% |
| indian | 78 | 2.1% |
| Other values (4) | 160 | 4.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 2481 | |
| b | 2342 | 10.3% |
| u | 2234 | 9.8% |
| i | 2050 | 9.0% |
| c | 1856 | 8.1% |
| P | 1778 | 7.8% |
| 0 | 1078 | 4.7% |
| t | 995 | 4.4% |
| a | 985 | 4.3% |
| r | 955 | 4.2% |
| Other values (19) | 6064 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 22818 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 2481 | |
| b | 2342 | 10.3% |
| u | 2234 | 9.8% |
| i | 2050 | 9.0% |
| c | 1856 | 8.1% |
| P | 1778 | 7.8% |
| 0 | 1078 | 4.7% |
| t | 995 | 4.4% |
| a | 985 | 4.3% |
| r | 955 | 4.2% |
| Other values (19) | 6064 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 22818 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 2481 | |
| b | 2342 | 10.3% |
| u | 2234 | 9.8% |
| i | 2050 | 9.0% |
| c | 1856 | 8.1% |
| P | 1778 | 7.8% |
| 0 | 1078 | 4.7% |
| t | 995 | 4.4% |
| a | 985 | 4.3% |
| r | 955 | 4.2% |
| Other values (19) | 6064 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 22818 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 2481 | |
| b | 2342 | 10.3% |
| u | 2234 | 9.8% |
| i | 2050 | 9.0% |
| c | 1856 | 8.1% |
| P | 1778 | 7.8% |
| 0 | 1078 | 4.7% |
| t | 995 | 4.4% |
| a | 985 | 4.3% |
| r | 955 | 4.2% |
| Other values (19) | 6064 |
company_age
Text
Missing 
| Distinct | 216 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 719 |
| Missing (%) | 7.2% |
| Memory size | 78.3 KiB |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 11.970477 |
| Min length | 11 |
Unique
| Unique | 26 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 55 years old |
|---|---|
| 2nd row | 34 years old |
| 3rd row | 29 years old |
| 4th row | 78 years old |
| 5th row | 29 years old |
| Value | Count | Frequency (%) |
| years | 9281 | |
| old | 9281 | |
| 16 | 320 | 1.1% |
| 17 | 280 | 1.0% |
| 23 | 273 | 1.0% |
| 15 | 253 | 0.9% |
| 24 | 236 | 0.8% |
| 13 | 236 | 0.8% |
| 19 | 231 | 0.8% |
| 27 | 230 | 0.8% |
| Other values (208) | 7222 |
Most occurring characters
| Value | Count | Frequency (%) |
| 18562 | ||
| e | 9281 | |
| y | 9281 | |
| s | 9281 | |
| o | 9281 | |
| a | 9281 | |
| r | 9281 | |
| l | 9281 | |
| d | 9281 | |
| 1 | 3762 | 3.4% |
| Other values (9) | 14526 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 111098 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 18562 | ||
| e | 9281 | |
| y | 9281 | |
| s | 9281 | |
| o | 9281 | |
| a | 9281 | |
| r | 9281 | |
| l | 9281 | |
| d | 9281 | |
| 1 | 3762 | 3.4% |
| Other values (9) | 14526 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 111098 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 18562 | ||
| e | 9281 | |
| y | 9281 | |
| s | 9281 | |
| o | 9281 | |
| a | 9281 | |
| r | 9281 | |
| l | 9281 | |
| d | 9281 | |
| 1 | 3762 | 3.4% |
| Other values (9) | 14526 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 111098 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 18562 | ||
| e | 9281 | |
| y | 9281 | |
| s | 9281 | |
| o | 9281 | |
| a | 9281 | |
| r | 9281 | |
| l | 9281 | |
| d | 9281 | |
| 1 | 3762 | 3.4% |
| Other values (9) | 14526 |
head_quarters
Text
Missing 
| Distinct | 1174 |
|---|---|
| Distinct (%) | 12.8% |
| Missing | 819 |
| Missing (%) | 8.2% |
| Memory size | 78.3 KiB |
Length
| Max length | 39 |
|---|---|
| Median length | 26 |
| Mean length | 8.7600479 |
| Min length | 3 |
Unique
| Unique | 711 ? |
|---|---|
| Unique (%) | 7.7% |
Sample
| 1st row | Mumbai |
|---|---|
| 2nd row | Dublin |
| 3rd row | Teaneck. New Jersey. |
| 4th row | Bangalore/Bengaluru |
| 5th row | Mumbai |
| Value | Count | Frequency (%) |
| mumbai | 1414 | 13.8% |
| new | 484 | 4.7% |
| chennai | 460 | 4.5% |
| delhi | 435 | 4.2% |
| noida | 415 | 4.0% |
| delhi/ncr | 353 | 3.4% |
| pune | 344 | 3.3% |
| bangalore/bengaluru | 330 | 3.2% |
| gurgaon/gurugram | 295 | 2.9% |
| kolkata | 284 | 2.8% |
| Other values (1236) | 5455 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 11307 | 14.1% |
| e | 6047 | 7.5% |
| u | 5408 | 6.7% |
| n | 5216 | 6.5% |
| r | 5146 | 6.4% |
| i | 5013 | 6.2% |
| o | 3974 | 4.9% |
| d | 3386 | 4.2% |
| l | 3345 | 4.2% |
| b | 2847 | 3.5% |
| Other values (64) | 28737 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 80426 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 11307 | 14.1% |
| e | 6047 | 7.5% |
| u | 5408 | 6.7% |
| n | 5216 | 6.5% |
| r | 5146 | 6.4% |
| i | 5013 | 6.2% |
| o | 3974 | 4.9% |
| d | 3386 | 4.2% |
| l | 3345 | 4.2% |
| b | 2847 | 3.5% |
| Other values (64) | 28737 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 80426 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 11307 | 14.1% |
| e | 6047 | 7.5% |
| u | 5408 | 6.7% |
| n | 5216 | 6.5% |
| r | 5146 | 6.4% |
| i | 5013 | 6.2% |
| o | 3974 | 4.9% |
| d | 3386 | 4.2% |
| l | 3345 | 4.2% |
| b | 2847 | 3.5% |
| Other values (64) | 28737 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 80426 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 11307 | 14.1% |
| e | 6047 | 7.5% |
| u | 5408 | 6.7% |
| n | 5216 | 6.5% |
| r | 5146 | 6.4% |
| i | 5013 | 6.2% |
| o | 3974 | 4.9% |
| d | 3386 | 4.2% |
| l | 3345 | 4.2% |
| b | 2847 | 3.5% |
| Other values (64) | 28737 |
reviews
Text
| Distinct | 881 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 2.73 |
| Min length | 2 |
Unique
| Unique | 228 ? |
|---|---|
| Unique (%) | 2.3% |
Sample
| 1st row | 66.7k |
|---|---|
| 2nd row | 42.5k |
| 3rd row | 38.4k |
| 4th row | 35.4k |
| 5th row | 30.9k |
| Value | Count | Frequency (%) |
| 72 | 135 | 1.4% |
| 67 | 134 | 1.3% |
| 73 | 131 | 1.3% |
| 69 | 130 | 1.3% |
| 71 | 128 | 1.3% |
| 70 | 125 | 1.2% |
| 68 | 124 | 1.2% |
| 77 | 119 | 1.2% |
| 81 | 112 | 1.1% |
| 75 | 110 | 1.1% |
| Other values (871) | 8752 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5707 | |
| 2 | 3108 | |
| 7 | 2740 | |
| 8 | 2355 | |
| 3 | 2264 | 8.3% |
| 9 | 2238 | 8.2% |
| 6 | 2202 | 8.1% |
| 4 | 1933 | 7.1% |
| 0 | 1926 | 7.1% |
| 5 | 1630 | 6.0% |
| Other values (2) | 1197 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 27300 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5707 | |
| 2 | 3108 | |
| 7 | 2740 | |
| 8 | 2355 | |
| 3 | 2264 | 8.3% |
| 9 | 2238 | 8.2% |
| 6 | 2202 | 8.1% |
| 4 | 1933 | 7.1% |
| 0 | 1926 | 7.1% |
| 5 | 1630 | 6.0% |
| Other values (2) | 1197 | 4.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 27300 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5707 | |
| 2 | 3108 | |
| 7 | 2740 | |
| 8 | 2355 | |
| 3 | 2264 | 8.3% |
| 9 | 2238 | 8.2% |
| 6 | 2202 | 8.1% |
| 4 | 1933 | 7.1% |
| 0 | 1926 | 7.1% |
| 5 | 1630 | 6.0% |
| Other values (2) | 1197 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 27300 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5707 | |
| 2 | 3108 | |
| 7 | 2740 | |
| 8 | 2355 | |
| 3 | 2264 | 8.3% |
| 9 | 2238 | 8.2% |
| 6 | 2202 | 8.1% |
| 4 | 1933 | 7.1% |
| 0 | 1926 | 7.1% |
| 5 | 1630 | 6.0% |
| Other values (2) | 1197 | 4.4% |
salaries
Text
| Distinct | 1219 |
|---|---|
| Distinct (%) | 12.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 3 |
| Mean length | 3.3006 |
| Min length | 1 |
Unique
| Unique | 180 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | 734.8k |
|---|---|
| 2nd row | 513.4k |
| 3rd row | 496.8k |
| 4th row | 370.3k |
| 5th row | 136.1k |
| Value | Count | Frequency (%) |
| 1.1k | 331 | 3.3% |
| 1.2k | 265 | 2.6% |
| 1.3k | 240 | 2.4% |
| 1.4k | 205 | 2.1% |
| 1k | 193 | 1.9% |
| 1.5k | 168 | 1.7% |
| 1.7k | 159 | 1.6% |
| 1.6k | 144 | 1.4% |
| 1.9k | 97 | 1.0% |
| 1.8k | 97 | 1.0% |
| Other values (1209) | 8101 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4103 | |
| k | 3720 | |
| . | 3302 | |
| 4 | 3015 | |
| 3 | 3009 | |
| 2 | 2845 | |
| 5 | 2774 | |
| 6 | 2540 | |
| 7 | 2312 | |
| 8 | 2106 | |
| Other values (2) | 3280 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 33006 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 4103 | |
| k | 3720 | |
| . | 3302 | |
| 4 | 3015 | |
| 3 | 3009 | |
| 2 | 2845 | |
| 5 | 2774 | |
| 6 | 2540 | |
| 7 | 2312 | |
| 8 | 2106 | |
| Other values (2) | 3280 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 33006 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 4103 | |
| k | 3720 | |
| . | 3302 | |
| 4 | 3015 | |
| 3 | 3009 | |
| 2 | 2845 | |
| 5 | 2774 | |
| 6 | 2540 | |
| 7 | 2312 | |
| 8 | 2106 | |
| Other values (2) | 3280 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 33006 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 4103 | |
| k | 3720 | |
| . | 3302 | |
| 4 | 3015 | |
| 3 | 3009 | |
| 2 | 2845 | |
| 5 | 2774 | |
| 6 | 2540 | |
| 7 | 2312 | |
| 8 | 2106 | |
| Other values (2) | 3280 |
interviews
Text
| Distinct | 281 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
Length
| Max length | 4 |
|---|---|
| Median length | 1 |
| Mean length | 1.4498 |
| Min length | 1 |
Unique
| Unique | 106 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | 5.6k |
|---|---|
| 2nd row | 3.8k |
| 3rd row | 3.3k |
| 4th row | 3.3k |
| 5th row | 1.7k |
| Value | Count | Frequency (%) |
| 3 | 837 | 8.4% |
| 4 | 826 | 8.3% |
| 5 | 777 | 7.8% |
| 2 | 754 | 7.5% |
| 6 | 627 | 6.3% |
| 1 | 588 | 5.9% |
| 7 | 557 | 5.6% |
| 8 | 457 | 4.6% |
| 9 | 411 | 4.1% |
| 10 | 319 | 3.2% |
| Other values (271) | 3847 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3287 | |
| 2 | 1945 | |
| 3 | 1666 | |
| 4 | 1478 | |
| 5 | 1296 | 8.9% |
| 6 | 1081 | 7.5% |
| 7 | 945 | 6.5% |
| 8 | 782 | 5.4% |
| 9 | 746 | 5.1% |
| 0 | 624 | 4.3% |
| Other values (3) | 648 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 14498 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 3287 | |
| 2 | 1945 | |
| 3 | 1666 | |
| 4 | 1478 | |
| 5 | 1296 | 8.9% |
| 6 | 1081 | 7.5% |
| 7 | 945 | 6.5% |
| 8 | 782 | 5.4% |
| 9 | 746 | 5.1% |
| 0 | 624 | 4.3% |
| Other values (3) | 648 | 4.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 14498 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 3287 | |
| 2 | 1945 | |
| 3 | 1666 | |
| 4 | 1478 | |
| 5 | 1296 | 8.9% |
| 6 | 1081 | 7.5% |
| 7 | 945 | 6.5% |
| 8 | 782 | 5.4% |
| 9 | 746 | 5.1% |
| 0 | 624 | 4.3% |
| Other values (3) | 648 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 14498 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 3287 | |
| 2 | 1945 | |
| 3 | 1666 | |
| 4 | 1478 | |
| 5 | 1296 | 8.9% |
| 6 | 1081 | 7.5% |
| 7 | 945 | 6.5% |
| 8 | 782 | 5.4% |
| 9 | 746 | 5.1% |
| 0 | 624 | 4.3% |
| Other values (3) | 648 | 4.5% |
jobs
Text
| Distinct | 296 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
Length
| Max length | 4 |
|---|---|
| Median length | 2 |
| Mean length | 1.7261 |
| Min length | 1 |
Unique
| Unique | 121 ? |
|---|---|
| Unique (%) | 1.2% |
Sample
| 1st row | 213 |
|---|---|
| 2nd row | 4.1k |
| 3rd row | 497 |
| 4th row | 316 |
| 5th row | 214 |
| Value | Count | Frequency (%) |
| 4054 | ||
| 1 | 719 | 7.2% |
| 2 | 548 | 5.5% |
| 3 | 415 | 4.2% |
| 4 | 327 | 3.3% |
| 5 | 279 | 2.8% |
| 6 | 242 | 2.4% |
| 8 | 208 | 2.1% |
| 7 | 197 | 2.0% |
| 9 | 168 | 1.7% |
| Other values (286) | 2843 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 8108 | |
| 1 | 2388 | 13.8% |
| 2 | 1431 | 8.3% |
| 3 | 1122 | 6.5% |
| 4 | 861 | 5.0% |
| 5 | 714 | 4.1% |
| 6 | 651 | 3.8% |
| 8 | 553 | 3.2% |
| 7 | 551 | 3.2% |
| 9 | 436 | 2.5% |
| Other values (3) | 446 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17261 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| - | 8108 | |
| 1 | 2388 | 13.8% |
| 2 | 1431 | 8.3% |
| 3 | 1122 | 6.5% |
| 4 | 861 | 5.0% |
| 5 | 714 | 4.1% |
| 6 | 651 | 3.8% |
| 8 | 553 | 3.2% |
| 7 | 551 | 3.2% |
| 9 | 436 | 2.5% |
| Other values (3) | 446 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17261 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| - | 8108 | |
| 1 | 2388 | 13.8% |
| 2 | 1431 | 8.3% |
| 3 | 1122 | 6.5% |
| 4 | 861 | 5.0% |
| 5 | 714 | 4.1% |
| 6 | 651 | 3.8% |
| 8 | 553 | 3.2% |
| 7 | 551 | 3.2% |
| 9 | 436 | 2.5% |
| Other values (3) | 446 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17261 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| - | 8108 | |
| 1 | 2388 | 13.8% |
| 2 | 1431 | 8.3% |
| 3 | 1122 | 6.5% |
| 4 | 861 | 5.0% |
| 5 | 714 | 4.1% |
| 6 | 651 | 3.8% |
| 8 | 553 | 3.2% |
| 7 | 551 | 3.2% |
| 9 | 436 | 2.5% |
| Other values (3) | 446 | 2.6% |
benefits
Text
| Distinct | 473 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 1.906 |
| Min length | 1 |
Unique
| Unique | 182 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | 11.3k |
|---|---|
| 2nd row | 7k |
| 3rd row | 5.7k |
| 4th row | 4.9k |
| 5th row | 3.7k |
| Value | Count | Frequency (%) |
| 9 | 414 | 4.1% |
| 11 | 405 | 4.0% |
| 13 | 399 | 4.0% |
| 12 | 384 | 3.8% |
| 10 | 376 | 3.8% |
| 8 | 345 | 3.5% |
| 14 | 335 | 3.4% |
| 15 | 333 | 3.3% |
| 7 | 308 | 3.1% |
| 6 | 276 | 2.8% |
| Other values (463) | 6425 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4936 | |
| 2 | 2823 | |
| 3 | 2008 | |
| 4 | 1574 | 8.3% |
| 5 | 1469 | 7.7% |
| 6 | 1356 | 7.1% |
| 9 | 1228 | 6.4% |
| 8 | 1214 | 6.4% |
| 7 | 1199 | 6.3% |
| 0 | 1079 | 5.7% |
| Other values (3) | 174 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 19060 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 4936 | |
| 2 | 2823 | |
| 3 | 2008 | |
| 4 | 1574 | 8.3% |
| 5 | 1469 | 7.7% |
| 6 | 1356 | 7.1% |
| 9 | 1228 | 6.4% |
| 8 | 1214 | 6.4% |
| 7 | 1199 | 6.3% |
| 0 | 1079 | 5.7% |
| Other values (3) | 174 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 19060 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 4936 | |
| 2 | 2823 | |
| 3 | 2008 | |
| 4 | 1574 | 8.3% |
| 5 | 1469 | 7.7% |
| 6 | 1356 | 7.1% |
| 9 | 1228 | 6.4% |
| 8 | 1214 | 6.4% |
| 7 | 1199 | 6.3% |
| 0 | 1079 | 5.7% |
| Other values (3) | 174 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 19060 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 4936 | |
| 2 | 2823 | |
| 3 | 2008 | |
| 4 | 1574 | 8.3% |
| 5 | 1469 | 7.7% |
| 6 | 1356 | 7.1% |
| 9 | 1228 | 6.4% |
| 8 | 1214 | 6.4% |
| 7 | 1199 | 6.3% |
| 0 | 1079 | 5.7% |
| Other values (3) | 174 | 0.9% |
Interactions
Correlations
| employee_count | ownership_status | rating | |
|---|---|---|---|
| employee_count | 1.000 | 0.135 | 0.081 |
| ownership_status | 0.135 | 1.000 | 0.144 |
| rating | 0.081 | 0.144 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
Sample
| name | rating | company_type | employee_count | ownership_status | company_age | head_quarters | reviews | salaries | interviews | jobs | benefits | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | TCS | 3.8 | IT Services & Consulting | 1 Lakh+ Employees | Public | 55 years old | Mumbai | 66.7k | 734.8k | 5.6k | 213 | 11.3k |
| 1 | Accenture | 4.1 | IT Services & Consulting | 1 Lakh+ Employees | Public | 34 years old | Dublin | 42.5k | 513.4k | 3.8k | 4.1k | 7k |
| 2 | Cognizant | 3.9 | IT Services & Consulting | 1 Lakh+ Employees | Forbes Global 2000 | 29 years old | Teaneck. New Jersey. | 38.4k | 496.8k | 3.3k | 497 | 5.7k |
| 3 | Wipro | 3.8 | IT Services & Consulting | 1 Lakh+ Employees | Public | 78 years old | Bangalore/Bengaluru | 35.4k | 370.3k | 3.3k | 316 | 4.9k |
| 4 | ICICI Bank | 4.0 | Banking | 1 Lakh+ Employees | Public | 29 years old | Mumbai | 30.9k | 136.1k | 1.7k | 214 | 3.7k |
| 5 | HDFC Bank | 3.9 | Banking | 1 Lakh+ Employees | Public | 29 years old | Mumbai | 30.6k | 123.6k | 1.4k | 376 | 3.2k |
| 6 | Infosys | 3.9 | IT Services & Consulting | 1 Lakh+ Employees | Public | 42 years old | Bengaluru/Bangalore | 29.1k | 413.1k | 4.5k | 779 | 5k |
| 7 | Capgemini | 3.8 | IT Services & Consulting | 1 Lakh+ Employees | Public | 56 years old | Paris | 27k | 336.6k | 2.3k | 512 | 3.9k |
| 8 | Tech Mahindra | 3.7 | IT Services & Consulting | 1 Lakh+ Employees | Public | 37 years old | Pune | 25.3k | 236.3k | 2.2k | 1.1k | 3.5k |
| 9 | HCLTech | 3.7 | IT Services & Consulting | 1 Lakh+ Employees | Public | 32 years old | Noida | 24.8k | 251.9k | 2.2k | 574 | 4k |
| name | rating | company_type | employee_count | ownership_status | company_age | head_quarters | reviews | salaries | interviews | jobs | benefits | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9990 | Calcutta High Court | 4.5 | Law Enforcement & Security | Public | NaN | Public | 161 years old | Kolkata | 66 | 578 | 1 | -- | 4 |
| 9991 | TSMT Technology India | 2.8 | Semiconductors | 201-500 Employees | NaN | 26 years old | Taoyuan | 66 | 519 | 2 | 7 | 8 |
| 9992 | Contizant Technologies | 4.1 | IT Services & Consulting | 11-50 Employees | NaN | 5 years old | Gurgaon/Gurugram | 66 | 385 | 13 | -- | 17 |
| 9993 | JBM Auto Limited Bus Division | 3.1 | NaN | NaN | NaN | NaN | NaN | 66 | 173 | 9 | -- | 5 |
| 9994 | Ecolog International | 4.6 | Logistics | 10k-50k Employees | NaN | 20 years old | Düsseldorf | 66 | 62 | -- | -- | 14 |
| 9995 | Advocate | 4.3 | IT Services & Consulting | 201-500 Employees | NaN | 22 years old | Atlanta | 66 | 605 | 4 | -- | 9 |
| 9996 | Adamas University | 3.0 | Education & Training | 51-200 Employees | NaN | 9 years old | Kolkata | 66 | 342 | 6 | -- | 7 |
| 9997 | Nagarjuna Cements | 4.0 | Engineering & Construction | 501-1k Employees | NaN | 28 years old | Hyderabad | 66 | 381 | 2 | -- | 7 |
| 9998 | Cumi Murugappa | 4.1 | NaN | NaN | NaN | NaN | NaN | 66 | 476 | 5 | -- | 5 |
| 9999 | Success Pact Consulting | 3.1 | Recruitment | 51-200 Employees | NaN | 12 years old | Noida | 66 | 223 | -- | 117 | 13 |
Duplicate rows
Most frequently occurring
| name | rating | company_type | employee_count | ownership_status | company_age | head_quarters | reviews | salaries | interviews | jobs | benefits | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 20Cube Logistics | 3.6 | Logistics | 51-200 Employees | NaN | 12 years old | Singapore | 106 | 731 | 10 | 50 | 15 | 2 |
| 1 | 6Sense | 3.9 | Software Product | 201-500 Employees | Startup | 10 years old | San Francisco | 72 | 304 | 3 | 24 | 40 | 2 |
| 2 | A.t.e. Enterprises | 4.0 | Industrial Machinery | 1k-5k Employees | NaN | 84 years old | Mumbai | 94 | 667 | 6 | 6 | 11 | 2 |
| 3 | ABB GISL | 4.2 | Electrical Equipment | 1k-5k Employees | NaN | NaN | NaN | 91 | 659 | 4 | -- | 24 | 2 |
| 4 | ACN Health care | 3.5 | Healthcare | 11-50 Employees | NaN | 12 years old | Bangalore/Bengaluru | 119 | 786 | 1 | -- | 12 | 2 |
| 5 | AG&P Pratham | 3.9 | NaN | NaN | NaN | NaN | NaN | 75 | 366 | 12 | -- | 9 | 2 |
| 6 | AGROCEL INDUSTRIES | 4.5 | Chemicals | 51-200 Employees | NaN | 38 years old | Mumbai | 103 | 474 | 7 | -- | 9 | 2 |
| 7 | ANAAMALAIS TOYOTA | 4.3 | Financial Services | 1k-5k Employees | NaN | 23 years old | Coimbatore | 68 | 245 | 3 | -- | 10 | 2 |
| 8 | ASC Technology Solutions | 4.6 | NaN | 51-200 Employees | NaN | NaN | NaN | 71 | 82 | 2 | -- | 3 | 2 |
| 9 | ATC Telecom Infrastructure | 4.1 | Telecom | 201-500 Employees | NaN | 19 years old | New Delhi | 197 | 753 | 7 | -- | 27 | 2 |